# Low Error Rate

Vntl Llama3 8b V2 Imatrix Gguf
QLoRA fine-tuned version based on LLaMA3 Youko, optimized for Japanese visual novel English translation with 8B parameters
Machine Translation Supports Multiple Languages
V
Casual-Autopsy
311
1
Reverb Diarization V2
Other
Reverb Speaker Diarization V2 is a speaker diarization model based on pyannote-audio, outperforming the baseline pyannote3.0 model on multiple test sets.
Audio Processing
R
Revai
4,073
45
Trocr Base Printed License Plates Ocr
A fine-tuned printed license plate OCR model based on microsoft/trocr-base-printed, with a character error rate of 0.037 on the evaluation set
Text Recognition Transformers
T
artbreguez
163
1
Fine Tashkeel
MIT
An Arabic precise diacritization system based on byte-level fine-tuned models, automatically completing Arabic text diacritics by fine-tuning pre-trained models.
Large Language Model Transformers Arabic
F
basharalrfooh
335
5
Wavlm Base 960h Asv19 Deepfake
A deepfake audio detection model fine-tuned based on Microsoft's WavLM-base, achieving excellent performance on the ASVspoof 2019 dataset with an accuracy of 99.79%
Audio Classification Transformers
W
abhishtagatya
16
0
Hubert Base 960h Asv19 Deepfake
Apache-2.0
An audio classification model based on the HuBERT architecture, specifically designed for detecting deepfake audio and audio spoofing
Audio Classification Transformers
H
abhishtagatya
15
2
Belle Whisper Large V3 Zh
Apache-2.0
A Chinese speech recognition model fine-tuned and optimized based on whisper-large-v3, showing significant performance improvements in multiple Chinese speech benchmarks
Speech Recognition Transformers
B
BELLE-2
1,666
112
Trocr Large Spanish
MIT
Transformer-based OCR model for Spanish printed text, optimized for printed fonts and does not support handwriting recognition
Image-to-Text Transformers Supports Multiple Languages
T
qantev
298
11
Trocr Base Printed License Plates Ocr
An OCR model fine-tuned based on microsoft/trocr-base-printed, specifically designed for recognizing printed license plate numbers.
Text Recognition Transformers
T
mariovigliar
202
1
Trocr Base Printed License Plates Ocr Timestamp
An OCR model fine-tuned based on microsoft/trocr-base-printed, specifically designed for recognizing license plates and timestamp information
Text Recognition Transformers
T
PQAshwin
132
1
Wespeaker Voxceleb Resnet293 LM
A speaker embedding model based on ResNet293 architecture, optimized with large margin fine-tuning, supporting tasks such as speaker recognition, similarity calculation, and speech segmentation
Speaker Analysis English
W
Wespeaker
108
3
Whisper Large V3 German
Apache-2.0
A fine-tuned German speech recognition model based on Whisper Large v3, optimized for German speech processing and recognition
Speech Recognition Transformers German
W
primeline
8,745
70
Trocr Base Printed Captcha Ocr
A captcha recognition model fine-tuned based on Microsoft's trocr-base-printed model, specifically designed for OCR tasks involving printed text
Text Recognition Transformers
T
chanelcolgate
33
1
Whisper Base Japanese
Apache-2.0
This model is fine-tuned on the Common Voice, JVS, and JSUT datasets for Japanese speech recognition tasks using openai/whisper-base.
Speech Recognition Transformers Japanese
W
Ivydata
137
3
Trocr Handwritten Math
This model can convert images of handwritten mathematical expressions into corresponding LaTeX sequences, suitable for mathematical formula recognition and digital processing.
Text Recognition Transformers
T
Azu
46
5
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase